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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of cMms in the application: 
Listing of Claims; 

1. (Currently Amended) A system that facilitates extracting data in 
connection with spam processing, comprising: 

a computer readable storage medium comprising: 

a component that receives a message and extracts a set of features 
associated with some part, content or content type of a message; and 

an analysis component that examines consecutiveness of characters within 
a subject line of the message and a content type of the message for spam in 
connection with building a filter, wherein the content type describes data 
contained within a body of the message, the content type being i s case- 
sensitive[[,]] and compris ing rf esll a primary content-type^ e^A a secondary- 
content type, or a_combination[[s]] thereof. 

2. (Original) The system of claim 1, the analysis component determines 
frequency of consecutive repeating characters within the subject line of the message. 

3. (Original) The system of claim 2, the characters comprise letters, 
numbers, or punctuation. 

4. (Original) The system of claim 1, the analysis component determines 
the frequency of white space characters within the subject line of the message. 
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5. (Currently Amended) The system of cMm 1, the analysis component 
determines distance between at least one alpha-numeric character and a blob , wherein the blob 
comprises a random sequence of characters, numbers, punctuation, or a combination thereof . 

6. (Original) The system of claim 1, the analysis component determines 
a maximum number of consecutive, repeating characters and stores this information. 

7. (Original) The system of claim 1, the analysis component establishes 
ranges of consecutive, repeating characters, the ranges corresponding to varying degrees of 
spaminess, whereby messages can be sorted by their respective individual count of consecutive 
repeating characters. 

8. (Canceled) 

9. (Previously Presented) The system of claim 1, the analysis 
component compares the content type of a current message to stored content types of a plurality 
of other messages to facilitate determining whether the message is spam. 

10-11. (Canceled) 

12. (Original) The system of claim 1, the analysis component further 
determines time stamps associated with the message. 

13. (Previously Presented) The system of claim 12, the analysis 
component determines a delta between time stamps. 
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14. (Original) The system of claim 13, the delta is between a first and a 
last time stamp. 

15. (Original) The system of claim 1, the aneilysis component determines 
at least one of: a percentage of white space to non- white space in the subject line of the message 
and a percentage of non-white space and non-numeric characters that are not letters in the subject 

line of the message. 

16. (Original) The system of claim 1, the filter being a spam filter. 

17. (Original) The system of claim 1, the filter being a parental control 

filter. 

18. (Original) The system of claim 1, further comprising a machine 
learning system component that employs at least a subset of extracted features to learn at least 
one of spam and non- spam. 

19-41. (Canceled) 

42. (Currently Amended) A method for evaluating spam as a function of 
message content, comprising: 

employing a processor executing computer readable instructions stored on 
a computer readable storage medium to implement the following: 

parsing a message to extract a set of features associated with a part, 
content, or content type of the message, wherein the content type describes the 
type of data contained within a body of the message, the content type being [[is]] 
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case-sensitive and comprisingHes]] a primary content-type^ £Hid a secondary 
content-type, or a_combination[[s]] thereof; €Hid 

examining the extracted set of features aftd to identify consecutiveness of 
repeating characters within a subject line of the message to classify the message 
as spam or not spam and to identify a distance of white-space characters between 
at least one alpha-numeric character and a blob comprising a random sequence of 
characters, numbers, punctuation, or a combination thereof ; and 

processing the message as a function of the classification. 

43. (Previously Presented) The method of claim 42, examining the 
consecutiveness of repeating characters comprises determining a frequency of the 
consecutiveness of repeating characters, wherein the characters comprise letters, numbers, 
punctuation, or white space. 

44. (Canceled) 

45. (Previously Presented) The method of claim 42, further comprising: 
establishing ranges of consecutive, repeating characters, the ranges 

correspond to various degrees of spaminess; and 

employing the ranges to sort messages by their respective individual count 
of consecutive repeating characters. 

46. (Previously Presented) The method of claim 42, further comprising 
comparing the set of features of the message to stored content types of a plurality of other 
message to determine whether the message is spam. 
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47. (New) One or more computer-readable media having computer- 
executable instructions embodied thereon that, when executed, perform a method for facilitating 
extracting data in connection with spam processing, comprising: 
receiving a message; 

extracting a set of features associated with some part, content or content 
type of the message; 

examining consecutiveness of characters within a subject line of the 
message and identifying a distance comprising a number of white-space 
characters between at least one alpha-numeric character and a blob comprising a 
random sequence of characters, numbers, punctuation, or a combination thereof; 

examining a content type of the message for spam in connection with 
building a filter, wherein the content type describes data contained within a body 
of the message, the content type being case-sensitive to capture a variation of a 
primary content-type, a secondary-content type, or a combination thereof, each of 
the primary content-type and the secondary-content type comprising one of a text, 
a multipart, a message, an image, an audio, a video, or an application; and 

determining at least one of a percentage of white space to non- white space 
in the subject line of the message and a percentage of non-white space and non- 
numeric characters that are not letters in the subject line of the message. 
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